Quang Duong
🚀 AI Station
About
✨ AI/ML, MLOps, LLMOps
📝 Best Practices, Tutorial Notebooks, AI Paper Reviews
Instruction Dataset Creation for Supervised Fine-Tuning
Leveraging LLMs for creating instruction dataset for Supervised Fine-Tuning
instruction-dataset
Aug 27, 2024
7 min
Preference Dataset Creation for DPO Fine-Tuning
Leveraging LLMs for creating preference dataset for DPO Fine-Tuning
preference-dataset
Aug 27, 2024
1 min
Finetuning Qwen2.5-3B using DPO with Unsloth
Finetuning Qwen2.5-3B with DPO using Unsloth on TinyStories prefrence dataset
Finetuning
DPO
Unsloth
Qwen
Aug 24, 2024
1 min
Finetuning Qwen2.5-3B with Unscloth
Finetuning Qwen2.5-3B with SFT-Lora using Unsloth on TinyStories instruction dataset
Finetuning
LORA
Unsloth
Aug 24, 2024
1 min
No matching items